SVM-based Voice Activity Detection for Distributed Specch Recognition System
نویسندگان
چکیده
Voice Activity Detection (VAD) algorithms based on machine learning techniques have shown competitive results in the area of automatic speech recognition. This paper describes a new approach of VAD based on Support Vector Machines (SVM) for Distributed Speech Recognition (DSR) system. In the proposed scheme, the speech and the non-speech frames are detected from the compressed Mel Frequency Cepstral Coefficients (MFCCs), at the back-end (e.g. server) side, with the aim of improving the VAD performance and reducing the compression bit-rate from the front-end side (e.g. client). By using the trained SVM with polynomial kernel, the SVM-based VAD show an encouraging detection results. The classification task conducted from the Aurora-2 speech database, using different noise conditions, illustrates comparable VAD performance, with respect to ETSI Advanced Front-End (ETSI-AFE) VAD algorithm. Keywordsvoice acivity detection; support vector machines; DSR system; mel frequency cepstral coefficients
منابع مشابه
An Intelligent Anti-Islanding Scheme for Synchronous-Based Distributed Generation Using Reduced-Noise Morphological Gradient
This paper presents a pattern recognition-based scheme for detection of islanding conditions in synchronous- based distributed generation (DG) systems. The main idea behind the proposed scheme is the use of spatial features of system parameters such as the frequency, magnitude of positive sequence voltage, etc. In this study, the system parameters sampled at the point of common coupling (PCC) w...
متن کاملSVM-Enabled Voice Activity Detection
Detecting the presence of speech in a noisy signal is an unsolved problem affecting numerous speech processing applications. This paper shows an effective method employing support vector machines (SVM) for voice activity detection (VAD) in noisy environments. The use of kernels in SVM enables to map the data into some other dot product space (called feature space) via a nonlinear transformation...
متن کاملVoice Activity Detection Using MFCC Features and Support Vector Machine
We define voice activity detection (VAD) as a binary classification problem and solve it using the support vector machine (SVM). Challenges in SVM-based approach include selection of representative training segments, selection of features, normalization of the features, and post-processing of the frame-level decisions. We propose to construct a SVMVAD using MFCC features because they capture th...
متن کاملVoice-based Age and Gender Recognition using Training Generative Sparse Model
Abstract: Gender recognition and age detection are important problems in telephone speech processing to investigate the identity of an individual using voice characteristics. In this paper a new gender and age recognition system is introduced based on generative incoherent models learned using sparse non-negative matrix factorization and atom correction post-processing method. Similar to genera...
متن کاملSpeech Event Detection Using Support Vector Machines
An effective speech event detector is presented in this work for improving the performance of speech processing systems working in noisy environment. The proposed method is based on a trained support vector machine (SVM) that defines an optimized non-linear decision rule involving the subband SNRs of the input speech. It is analyzed the classification rule in the input space and the ability of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015